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Abstract 

Background: Using tine piggyBac-mediated GAL4/UAS transgenic system establislied in tlie sill<wornn, Bombyx mori, 
we liave previously reported tliat overexpression of tlie Rasl^^ oncogene specifically in the posterior silk gland 
(PSG) innproved cell growth, fibroin synthesis, and thus silk yield. However, the detailed nnolecular nnechanisnn 
rennains to be fully elucidated. To achieve this goal, lllunnina sequencing was used in the present study to connpare 
the transcriptomes of the to/^^-overexpressed and wildtype PSGs. 

Results: The transcriptonnic sequencing results in 56 nnillion reads following filtering steps. Most of the reads 
(-70%) are successfully mapped to the Bombyx genome. The mapped reads are situated within at least 9,133 
predicted genes, covering 62.46% genes of the Bombyx genome. GO annotation shows that 2512 of the 2,636 
differentially expressed genes (DEGs) are mostly distributed in metabolic process, cell and cell part, and binding, 
and KEGG annotation shows that 1,941 DEGs are mapped into 277 pathways. Importantly, Rasl^^ overexpression in 
the PSG upregulated many DEGs distributed in "pathways in cancer", "insulin signaling pathway", and "MAPK 
signaling pathway" as well as "purine metabolism" and "pyrimidine metabolism". Transcriptional regulation of these 
DEGs was verified by quantitative real-time PGR. Moreover, injection of small-molecule chemical inhibitors of the 
Rasl downstream effectors into the /?as/^^-overexpressed silkworms revealed that both Raf-MAPK and PI3K-T0RC1 
pathways are required for the Rasl -induced DEG expression. 

Conclusion: The transcriptomic analysis illustrates that, apart from phosphorylational regulation, Rasl activates its 
downstream Raf-MAPK and PI3K-T0RC1 pathways at the transcriptional level. Meanwhile, Rasl increases DNA content 
and induces endoreplication, at least in part, by upregulating genes in "nucleotide metabolism" and "cell cycle". 
This study provides further insights into the molecular mechanism of how Rasl^^ overexpression in the PSG 
improves silk yield. 



Background 

As a traditional agricultural industry, sericulture is eco- 
nomically important to China and several other coun- 
tries. The domesticated silkworm, Bombyx mori, is the 
most important insect species for sericulture. In the 
past, sericulture has been greatly advanced by applying 
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the hybrid breeding technique to Bombyx. However, it 
has reached a plateau during the last decades, mostly 
due to the inherent threshold of this technique. To 
break through the bottleneck of silk production, new 
breeding techniques, such as the molecular breeding 
technique, should be developed. It has been hypothe- 
sized that fibroin production in the Bombyx posterior 
silk gland (PSG) is directly proportional to silk yield and 
determined by its gland size and protein synthesis [1], 
making it possible to improve silk yield by genetic ma- 
nipulation of the PSG [2]. On the other hand, Bombyx 
has been used as a model lepidopteran insect for a long 
time [3,4]. In terms of protein synthesis, its PSG is one 
of the most efficient organs in animals. Therefore, 
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studying the molecular mechanism controlling fibroin 
synthesis in the Bombyx PSG is of great value for both 
applied and basic research. 

Using the piggyBac-rnQdidXed GAL4/UAS transgenic 
system established in Bombyx [5-7], we specifically over- 
expressed a constitutively active form of Rasl (Rasl^^) 
in the PSG. In the transgenic silkworm, Fil-GAL4 > UAS- 
Rasl^^ (Fil > Rasl^^), Rasl^^ overexpression increases 
gland size and protein synthesis in the PSG, leading to 
silk yield improvement by 60% [2]. This study not only 
provides an application prospect to silk yield improve- 
ment in sericulture, but also supports the previous hy- 
pothesis that fibroin production is determined by gland 
size and protein synthesis in the PSG [1]. It is certain 
that Rasl and its downstream Raf-MAPK and PI3K- 
TORCl pathways play critical roles in regulating fibroin 
production [2], while the detailed molecular mechanism 
remains to be fully elucidated. The completed Bombyx 
genome sequence [8,9] makes it possible to use functional 
genomics, such as proteomics and transcriptomics, to 
achieve the above goal. Using 2D-DIGE-MS/MS analysis, 
we previously compared the proteomic profiles of the 
7?<3^57^^-overexpressed and wild type (WT) PSGs. Further 
studies revealed that, via the downstream Raf-MAPK and 
PI3K-TORC1 pathways, Rasl^^ upregulates bcpi, which 
inhibits cathepsin activity thus preventing PSG destruc- 
tion during metamorphosis [10]. Transcriptomics could 
be an alternative approach for analyzing how Rasl^^ over- 
expression in the PSG improves fibroin production. 

In terms of transcriptomic tools, a whole-genome 
microarray containing 22,987 oligonucleotides of 70-mers 
that cover the presently known and predicted genes in the 
Bombyx genome was designed on the basis of the whole- 
genome sequences [11]. This microarray has been used to 
survey the silkworm gene expression patterns in multiple 
tissues, at different developmental stages, and under 
various conditions or treatments [11-15]. RNA-Seq 
(also known as Next Generation Sequencing), including 
Roche/454 pyrosequencing, Illumina-Solexa sequencing, 
and Applied Biosystems SOLID sequencing, has led to a 
revolution in genomics and provided cheaper and faster 
delivery of sequencing information [16]. The Illumina- 
Solexa sequencing strategy was adopted for the sequen- 
cing of 40 Bombyx genomes from 29 phenotypically and 
geographically diverse domesticated silkworm lines and 
11 wild silkworms from various mulberry fields in 

Table 1 Raw data preprocessed results 

Sample Raw data 

Read Base Read 



China. This comprehensive study constructs a genome- 
wide genetic variation map which shed light on the history 
of silkworm domestication [9]. RNA-Seq also led to the 
identification of new exons, novel genes, alternative spli- 
cing genes, and trans-s^Xicmg events in Bombyx [17,18]. 

In this study, Illumina-Solexa sequencing revealed 
2,636 differentially expressed genes (DEGs) in the 
i?<3^5i^^-overexpressed and WT PSGs. Confirmed by 
quantitative real-time PCR (qPCR), the transcriptomic 
analysis shows that Rasl increases gland size, protein 
synthesis, and DNA content in the PSG at the transcrip- 
tional level. 



Results 

Identification of DEGs using RNA-seq 

For better understanding the molecular mechanism how 
Rasl^^ overexpression increases fibroin production in the 
Bombyx PSG, we compared the transcriptomes of the 
i?<3^5i^^-overexpressed and WT PSGs at the early wander- 
ing stage. The RNA-seq raw data were deposited to NCBI 
SRA with the accession number SRP026709 (http://www. 
ncbi.nlm.nih.gov/sra/?term=SRP026709). The accession 
numbers for the two WT PSG RNA-seq libraries are 
SRX320122 and SRX320124, and those for the two Rasf^- 
overexpressed PSG RNA-seq libraries are SRX320125 and 
SRX320126. The RNA-seq raw data of SRX320122 and 
SRX320124 were combined for raw data processing, so were 
SRX320125 and SRX320126. Using the pair-end Illumina- 
Solexa sequencing strategy, we obtained 69,662,027 raw 
reads, containing about 6.9 gigabases with an average read 
length of 101 nucleotides. The raw data was preprocessed, 
and 13,238,008 (19%) low quality reads were removed. 
The remaining 56,424,019 reads, with an average length of 
90.71 nucleotides (Table 1), were used to map the Bombyx 
genome (release_2.0) [19] using TopHat [20]. 

In total, 39,967,028 reads were mapped to silkworm 
genome database (silkgenome.fa) and silkworm gene data- 
base (sill<worm_glean.gff), with approximately 2.2 and 1.8 
million reads from the ^"^-overexpressed and WT 
PSGs, respectively. The completed Bombyx genome con- 
tains 14,623 unigenes [8,9]. Using the Fragments Per kb 
Per Million Reads (FPKM) method [21], we have found 
that 9,133 unigenes are expressed, with 6,962 and 6,429 
unigenes in the ^^-overexpressed and WT PSGs, res- 
pectively. The mapping coverage is 62.46% genes of the 



Valid data Valid ratio (reads) 

Base Average length 



WT 33218225 3321822500 26436677 2385302793 90.38 79.79% 

Fil>Ras1*^'^ 36443802 3644380200 29987342 2732788395 91.12 81.76% 

All 69662027 6966202700 56424019 5118091323 90.71 81.00% 
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silkworm genome, showing a high confidence of RNA-seq 
in this study (Table 2). 

Importantly, RNA-Seq revealed 2,636 DEGs, with 1708 
upregulated and 938 downregulated genes in the Rasl^^- 
overexpressed PSG compared to the WT PSG (Figure 1). 
Next, the DEGs were subjected to functional annotation 
and qPCR verification. 

Functional annotation of DEGs 

Gene ontology (GO) assignments were used to classify the 
functions of DEGs revealed by transcriptomic analysis. The 
DEGs were termed by GO ontology in three categories, 
namely biological process, cellular component, and molecu- 
lar function. In total, 2512 DEGs (95.3% of 2636) were anno- 
tated in 60 GO functional groups (Figure 2). In the category 
of biological process, over 43% of DEGs were distributed in 
"metabolic process", nearly 40% DEGs were classified into 
"cellular process", while very few DEGs were found in 
"growth", "carbon utilization", "viral reproduction", "rhyth- 
mic process", and "locomotion" (Figure 2A). In the cate- 
gory of cellular component, over 35% of DEGs were 
distributed in "cell" and "cell part", however, very small 
numbers of DEGs were found in "virion part", "synapse", 
and "extracellular matrix part" (Figure 2B). In the category 
of molecular function, the terms "binding" and "catalytic 
activity" enriched 32% and 30% of DEGs, respectively. 
By contrast, few DEGs were distributed in "channel 
regulator activity", "receptor regulator activity", and 
"metallochaperone activity" (Figure 2C). 

To identify the biological pathways that are active in 
the 7?<3^57 ^^-overexpressed PSG, we mapped the DEGs to 
the reference canonical pathways in Kyoto Encyclopedia 
of Genes and Genomes (KEGG) [22]. Among the 2636 
DEGs, 1,941 sequences predicted to encode enzymes 
with enzyme commission (EC) numbers were mapped 
into 277 KEGG pathways. The top 25 KEGG pathways 
are shown in Table 3. The top 5 KEGG pathways with 
most representations by the DEGs were "purine metabo- 
lism", "spliceosome", "pathways in cancer", "RNA trans- 
port", and "HTLV-1 infection", implying that these 
diverse metabolic processes are active in the Rasl^^- 
overexpressed PSG. "Purine metabolism" is essential, 
providing components of the nucleotides, DNA and 
RNA, and the energy currency of the cell, ATP [23]. In 
parallel, "Pyrimidine metabolism" also ranks on the top 
6 KEGG pathway (Table 3). In the Rasl^^-overexpressed 



PSG, the DNA content is nearly doubled and endoreplica- 
tion is enhanced in comparison with the WT PSG, indi- 
cating that more active nucleotide metabolism is required 
[2]. "Spliceosome" is complex molecular machinery in- 
volved in removal of introns from mRNA precursors [24] . 
A lot of DEGs in this pathway suggest occurrence of high 
frequency of mRNA splicing for translation in the Rasl^^- 
overexpressed PSG. Interestingly, a large number of DEGs 
are found distributed in "pathways in cancer". As an import- 
ant oncogene, Ras plays important roles in both normal de- 
velopment and aberrant biological processes, such as 
tumorigenesis and developmental disorders [25]. It is lil<ely 
that, apart from the phosphorylational regulation, Ras could 
also activate "pathways in cancer" at the transcriptional level. 
"RNA transport" from the nucleus to the cytoplasm is fun- 
damental for gene expression regulation. Most eukaryotic 
RNAs are produced in the nucleus by RNA polymerase I, II, 
or III. The RNA molecules undergo a variety of posttran- 
scriptional processing events, and a majority of them are lo- 
calized to their functional sites in the cell [26]. In 
accordance with this result, ribosome biogenesis for mRNA 
translation was stimulated in the i^^^^si ^"^-overexpressed 
PSG [2]. Unexpectedly, some DEGs were annotated into 
"HTLV-1 infection pathway", but the sillcworm cannot be af- 
fected by this human virus. Our preliminary data shows that 
virus-resistance is enhanced when Rasl^^ is globally overex- 
pressed in the transgenic silkworm, Actin3-GAL4 > UAS- 
Rasl^"^, suggesting that the DEGs in "HTLV-1 infection 
pathway" might play a role in antivirus in Bombyx, 

In short, these annotations provide a valuable insight 
into the specific processes, functions, and pathways and 
facilitate the identification of DEGs resulted from the 
Ras 1^^ -overexpressed PSG. Next, the DEGs in several 
important and top KEGG pathways were chosen for 
qPCR verification (Table 3). 

qPCR verification of DEGs in "pathways in cancer", "insulin 
signaling pathway", and "MARK signaling pathway" 

In our previous study, we have determined that Rasl^^ 
overexpression in the PSG increases Ras activity, resulting 
in phosphorylation of the Ras downstream effector proteins, 
Raf and PI3K110, which in turn activate the Raf-MAPK and 
PI3K-TORC1 pathways, respectively [2]. Surprisingly, 
among all the 277 KEGG pathways, a large number of 
DEGs are distributed in "pathways in cancer", "insulin 
signaling", and "MAPK signaling pathway" (Additional file 



Table 2 Evaluation of valid reads mapped to reference genome of each sample 

QC data IVlapp_reads IVlapp% Unigenes Junc_reads Junc% 

WT 26436677 17583378 66.54% 6429 5694420 21.62% 

Fil>Ras1*^'^ 29987342 22383650 73.35% 6962 7878212 25.33% 

All 56424019 39967028 70.83% 9133 13572632 24.05% 
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Figure 1 Gene expression in RasT^'^-overexpressed and WT 
PSGs. All reads were aligned to predicted genes and are shown as 
log 10 values derived from cDNAs of the /?as/^^-overexpressed and 
WT PSGs (F-test, P, <2.2x 10"^^). The Rasl-upregulated genes and 
downregulated genes are marked with red and green 
spots, respectively. 



1: Figure SI, Additional file 2: Figure S2 and Additional file 
3: Figure S3), which rank on the top 3, 9, and 15 KEGG 
pathways containing 49, 35, and 32 DEGs, respectively 
(Table 4). 

Since these three KEGG pathways are highly related to 
each other, we analyzed their common DEGs and veri- 
fied them by qPCR (Figure 3). First, 4 DEGs were found 
in all the three pathways, including cr/c, mek, jnk, and 
erk. qPCR data were in agreement with the transcrip- 
tomic results, showing that all these 4 genes are upregu- 
lated in the i^^^^si^'^-overexpressed PSG (Figure 4A). 
Second, 5 DEGs were observed in both "pathways in 
cancer" and "insulin signaling pathway", including piSkl, 
piSks, cblly cbl2, and cbl3 (Figure 4B). The transcrip- 
tomic results and qPCR data showed that piSks^ cbll, 
cbl2, and cbl3 are all upregulated by Rasl^"^, whereas 
the qPCR data of piSkl did not match its transcriptomic 
result suggesting it might also be upregulated by Rasl^^. 
Third, there were 3 DEGs in both "pathways in cancer" 
and "MAPK signaling pathway", including fgfrl, pkc, and 
fgfr2. The transcriptomic results and qPCR data showed 
the fgfrl and pkc are upregulated and downregulated by 
Rasl^^, respectively, while fgfr2 might be downregulated 
by Rasl^^ (Figure 4C). Finally, pka, a single DEG in 
both "insulin signaling pathway" and "MAPK signaling 
pathway", was downregulated by Rasl^^ (Figure 4D). 

We randomly selected some DEGs in each individual 
pathway for qPCR verification (Figure 5). In "pathways 
in cancer", both the transcriptomic results and qPCR 
data revealed that 7 DEGs, including cdk4/6y ptenU fu, 



cdk2, raUfaks, and elongb, are all upregulated by Rasl , 
while fakl is downregulated. Although the transcriptomic 
results suggested that ptens and smo are upregulated by 
Rasl^"^, qPCR data showed that they might be downreg- 
ulated (Figure 5A). In "insulin signaling pathway", 7 
DEGs, including rheb, fasl, fass, apkc, apkc, eif4e, and 
ampk, are all upregulated by Rasl^^, while phk is down- 
regulated, and she might be upregulated (Figure 5B). In 
"MAPK signaling pathway", p3^ and mapkapk are up- 
regulated by Rasl^"^, while hsp70s and tak are downreg- 
ulated, and daxx might be downregulated (Figure 5C). 

qPCR verification of DEGs in "purine metabolism" and 
"pyrimidine metabolism" 

Interestingly, abundant DEGs were annotated in two major 
nucleotide metabolism pathways, "purine metabolism" and 
"pyrimidine metabolism" (Additional file 4: Figure S4 and 
Additional file 5: Figure S5), ranking on the top 1 and 6 
KEGG pathways which contain 67 and 42 DEGs, respec- 
tively (Table 3). 

We thus verified the above-mentioned hypothesis that 
Ras might activate nucleotide metabolism by qPCR verifi- 
cation of some randomly selected DEGs in both "purine 
metabolism" and "pyrimidine metabolism" (Figure 6). We 
first analyzed 10 of the 28 common DEGs in both path- 
ways, including pole4, pole2, rpbS, rpclO, rpb4, rpc37, apf, 
itpa, rpc2S, and ntSe, Different from the transcriptomic 
results, qPCR data suggest that rpc25 and ntSe might 
be upregulated rather than downregulated by Rasl 
(Figure 6A). We then analyzed 5 of the 39 DEGs only 
in "purine metabolism", including adk, allc, prps, pde, and 
gart Although gart expression was inconsistent between 
its transcriptomic and qPCR data, all the other DEGs are 
upregulated (Figure 6B). In addition, urhl, one of DEGs 
only in "pyrimidine metabolism" is also upregulated 
(Figure 6C). Taken together, Rasl^^ overexpression in the 
PSG upregulates most, if not all, DEGs in "purine meta- 
bolism" and "pyrimine metabolism". 

Identification of Ras1 downstream signals in regulating 
DEGs by small-molecule inhibitor treatments 

We next investigated which downstream pathway(s) that 
Rasl utilizes to regulate DEGs in the Bombyx PSG by 
injecting small-molecule inhibitors of the Ras down- 
stream effectors (Rafi, Raf inhibitor; LY294002, PI3K in- 
hibitor; rapamycin, TORCl inhibitor; 15 (ig/larva) into 
the Ras l^^'Overexpressed silkworm larvae [10]. Some 
Ras l^^'Upreguldited DEGs, which are consistent in both 
transcriptomic results and qPCR data, were chosen for 
inhibitor treatment experiments by qPCR analysis to 
examine their expression levels. 

First, we detected the common DEGs annotated in 
"pathways in cancer", "insulin signaling pathway", and 
"MAPK signaling pathway" by qPCR (Figure 7A-C). The 
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Figure 2 Histogram presentation of Gene Ontology classification of DEGs in /?c/s 7 *"'^-overexpressed and WT PSGs. The results are 
summarized in three main categories: biological process (A), cellular component (B), and molecular function (C). The right y-axis indicates the 
number of genes in a category. The lefty-axis indicates percentage of DEGs in a specific category. Different color represents different percentage: 
dark blue, 0-5%; dark red, 5-10%, dark purple, 10-20%; green, 20-30%; orange, 30-40%; red, over 40%. 
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Table 3 A comparison between qPCR verification results and transcriptomic data of DEGs in five KEGG pathways 



Pathway Number of DEGs Number of Ratio of Number of Ratio of 

verified by qPCR matched DEGs matched DEGs unmatched DEGs unmatched DEGs 



Pathways in cancer 


22 


18 


81.8% 


4 


1 8.2% 


Insulin signaling 


18 


16 


88.9% 


2 


11.1% 


MAPK signaling 


13 


11 


84.6% 


2 


15.4% 


Purine metabolism 


15 


12 


80.0% 


3 


20.0% 


Pyrimidine metabolism 


11 


8 


72.7% 


3 


27.3% 



mRNA levels of mek, erk, and jnk distributed in all 
the three pathways were decreased to 10-20% by 
Rafi and 20-40% by LY294002, whereas rapamycin treat- 
ment showed weaker inhibitory effects (40-60%) 
(Figure 7A). For piSks, cbl2, and cblS^ the three DEGs 
presented in both "pathways in cancer" and "insulin 
signaling pathway", LY294002 and rapamycin showed 
the strongest and weakest inhibitory effects, respectively 
(Figure 7B). By contrast, rapamycin strongly inhibited 



expression of fgfrl, the DEGs distributed in "pathways in 
cancer" and "MAPK signaling" (Figure 7C). 

Second, we detected the individual DEGs anno- 
tated in "pathways in cancer" (Figure 7D), "insulin 
signaling pathway" (Figure 7E), and "MAPK signaUng 
pathway" (Figure 7F) by qPCR. For most of the 
DEGs, Raf inhibitor exhibited the strongest inhibi- 
tory effects, while rapamycin showed little to no in- 
hibitory effects. 



Table 4 Number of KEGG orthologs (KO) in pathways with top mapped KOs 



Pathway 


Pathway type 


DEGs 


ko00230. Purine metabolism 


Metabolism/Nucleotide metabolism 


67 


ko03040, Spliceosome 


Genetic information processing^ranscription 


52 


ko05200. Pathways in cancer 


Human diseases/Cancers 


49 


ko03013, RNA transport 


Genetic information processing^ranslation 


47 


ko05166, HTLV-I infection 


Human diseases/Infectious diseases: Viral 


44 


ko00240, Pyrimidine metabolism 


Metabolism/Nucleotide Metabolism 


42 


ko04110, Cell cycle 


Cellular processes/Cell growth and death 


42 


ko04510, Focal adhesion 


Cellular processes/Cell communication 


38 


ko04910. Insulin signaling pathway 


Organismal systems/Endocrine system 


35 


ko03008, Ribosome biogenesis in eukaryotes 


Genetic information processing^ranslation 


35 


ko05016, Huntington's disease 


Human diseases/Neurodegenerative diseases 


34 


ko04146, Peroxisome 


Cellular processes^ransport and catabolism 


34 


ko00561, Glycerolipid metabolism 


Metabolism/Lipid metabolism 


32 


koOSOlO, Alzheimer's disease 


Human diseases/Neurodegenerative diseases 


32 


ko04010, MAPK signaling pathway 


Environmental information processing /Signal transduction 


32 


ko041 11, Cell cycle - yeast 


Cellular processes/Cell growth and death 


32 


ko04144, Endocytosis 


Cellular processes^ransport and catabolism 


31 


ko04810, Regulation of actin cytoskeleton 


Cellular processes/Cell motility 


30 


ko00564, Glycerophospholipid metabolism 


Metabolism/Lipid metabolism 


29 


ko05169, Epstein-Barr virus infection 


Human diseases/Infectious diseases: Viral 


29 


ko00260, Glycine, serine and threonine metabolism 


Metabolism/Amino acid metabolism 


28 


ko04120, Ubiquitin mediated proteolysis 


Genetic information processing/Folding, sorting and degradation 


28 


ko041 13, Meiosis - yeast 


Cellular processes/Cell growth and death 


28 


ko03010, Ribosome 


Genetic information processing^ranslation 


25 


ko04145. Phagosome 


Cellular processes^ransport and catabolism 


25 



The datas marked with boldface were chosen for further analysis. 
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Third, we detected the DEGs annotated in "purine me- 
tabolism" and "pyrimidine metabolism" (Figure 8). For 
DEGs in both pathways, LY294002 exhibited the strongest 
inhibitory effects (Figure 8A). For the two DEGs only in 
"purine metabolism", Raf inhibitor and LY294002 showed 
the strongest inhibitory effects on pde and allc, respec- 
tively (Figure SB). 

In summary, inhibitors of the Ras downstream effectors 
showed inhibitory efforts on different DEGs to varying de- 
grees indicating that both Raf-MAPK and PI3K-TORC1 
pathways are involved in the transcriptional regulation of 
those DEGs. Interestingly, similar results were observed in 
mammalian cells in which Ras is overexpressed or trans- 
formed [27,28]. 



Discussion 

Rasi transcriptionally activates its downstream Raf-MAPK 
and PI3K-T0RC1 pathways 

On a genome-wide scale, the identification of Ras- 
responsive genes has become feasible using different 
transcriptomic tools. For example, subtractive suppres- 
sion hybridization was performed in immortalized, 
non-tumorigenic rat embryo fibroblasts and in Ras- 
transformed cells. The results have shown that many 
DEGs are involved in almost all aspects of cellular 
growth control and cell survival [29]. A microarray 
was conducted in Ras^^ -trdinsformed mouse embry- 
onic fibroblasts, showing that many genes encoding 
cell growth-related proteins are upregulated [30]. The 
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I 0.4 



* 

1 



Figure 4 Verification of transcriptomic results of DEGs in "pathways in cancer", "insulin signaling pathway", and "MARK signaling 
pathway" by qPCR. (A) Common DEGs in all the three pathways. (B) Common DEGs in "pathways in cancer" and "insulin signaling pathway". (C) 
Common DEGs in "pathways in cancer" and "MARK signaling pathway". (D) Common DEG in "insulin signaling pathway" and "MARK signaling 
pathway". The transcriptional results of DEGs (FRKM) are marked with black. The q-RCR data of DEGs upregulated and downregulated by Rasi are 
marked with red and green. The mis-matched DEGs are marked with gray. WT: wildtype; Rasi: Fil-GAL4> UAS-Rasl^^. 
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Figure 5 Verification of transcriptomic results of DEGs only in "pathways in cancer", "insulin signaling pathway", or "MARK signaling 
pathway" by qPCR. DEGs only in "pathways in cancer" (A), "insulin signaling pathway" (B), "MARK signaling pathway" (C). The transcriptional 
results of DEGs (FPKM) are marked with black. The q-PCR data of DEGs upregulated and downregulated by Rasi are marked with red and green. 
The mis-matched DEGs are marked with gray. WT\ wildtype; Rasi: Fil-GAL4 > UAS-Rasl^^. 



results of Ras-induced gene expression profiling stud- 
ies based on subtractive suppression hybridization and 
microarrays were extensively summarized [27], reveal- 
ing that Kruppel-like factor 5, the CD44 antigen, and 
members of the epidermal growth factor (EGR)-family 
are common Ras downstream effectors [28]. Using 
lUumina-Solexa sequencing to analyze DEGs in the 
PSG, here we found that many Rasl-induced genes 
are distributed in "pathways in cancer", "insulin 



signaling", and "MAPK signaling pathway". The tran- 
scriptomic analysis illustrates that, apart from phos- 
phorylational regulation, Rasi can also activate its 
downstream Raf-MAPK and PI3K-TORC1 pathways at 
the transcriptional level (Figure 3, 4 and 5, Additional 
file 1: Figure SI, Additional file 2: Figure S2 and Add- 
itional 3: Figure S3, Table 4). To our knowledge, this 
is the first report that Rasi can transcriptionally acti- 
vate its downstream pathways at a global level. 
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Figure 6 Verification of transcriptomic results of DEGs in "purine metabolism" and "pyrimidine metabolism" by qPCR. (A) Common 
DEGs in both pathways. (B) DEGs only in "purine metabolism". (C) DEGs only in "pyrimidine metabolism". The transcriptional results of DEGs 
(FPKM) are marked with black. The q-PCR data of DEGs upregulated and downregulated by Rasi are marked with red and green. The mis-matched 
DEGs are marked with gray. WT: wildtype; Rasi : Fil-GAL4 > UAS-Rasl^^. 
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Figure 7 Small-molecule chemical inhibitor treatments to identify which Rasi downstream signaling pathways are involved in 
regulating DEGs in "pathways in cancer", "insulin signaling pathway", and "MARK signaling pathway". (A) Common DEGs in all the three 
pathways. (B) Common DEGs in "pathways in cancer" and "insulin signaling pathway". (C) A common DEG in "pathways in cancer" and 
"MARK signaling pathway". (D) DEGs only in "pathways in cancer". (E) DEGs only in "insulin signaling pathway". (F) DEGs only in "MARK signaling 
pathway". Small-molecule chemical inhibitors of the Ras downstream effectors (Rafi, Raf inhibitor; LY294002, RISK inhibitor; Rapa, rapamycin, TORCl 
inhibitor; 15 |jg/larva) are injected into to /^^-ove rex pressed silkworm larvae. Inhibitors are marked with different color: black, DMSO as a control- 
red: Raf inhibitor; green, LY294002; blue: rapamycin. 
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Figure 8 Small-molecule inhibitor treatments to identify which Ras1 downstream signaling pathways are involved in regulating DEGs 
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Rasi transcriptionally activates genes involved in 
nucleotide metabolism and cell cycle for increasing DNA 
content and inducing endoreplication 

Earlier studies have shown that a lot of key enzymes of 
nucleotide metabolism and DNA biosynthesis, such as 
CTP synthetase, thymidylate synthase, dihydrofolate re- 
ductase, IMP dehydrogenase, ribonucleotide reductase, 
DNA polymerase, and DNA methyltransferase, are mark- 
edly upregulated in certain tumor cells, which supports 
the excessive proliferation of transformed cells [31]. The 
microarray conducted in Ras^^ -trdinsformed mouse em- 
bryonic fibroblasts revealed that many genes encoding 
DNA-associated proteins (involved in DNA replication 
and DNA repair) are upregulated as well [30]. Interest- 
ingly, a microarray analysis of Ras -overexpressed hemo- 
cytes in the fruitfly, Drosophila melanogaster, showed that 
a large number of genes that are functionally important in 
cell cycle regulation and DNA replication were upregu- 
lated [32]. We have previously shown that compared to 
the WT PSG, total DNA content is nearly doubled in the 
7?<3^5i^^-overexpressed PSG [2]. Moreover, in comparison 
with the WT PSG, BrdU incorporation in the Rasf-^- 
overexpressed PSG is much higher indicating enhanced 
endoreplicative cycles [2]. In this study, we found many 
Rasl -induced genes are enriched in "purine metabolism", 
"pyrimidine metabolism", and "cell cycle", which ranks top 
1, 6, and 7, respectively (Figure 6, Table 3). Therefore, it is 
likely that Rasl transcriptionally activates genes involved 
in nucleotide metabolism and cell cycle for increasing 
DNA content and inducing endoreplication in the PSG. 

Conclusion 

About 46 years before, it has been hypothesized that fi- 
broin production in the Bombyx PSG is directly propor- 
tional to silk yield and determined by its gland size and 
protein synthesis [1]. Based on this hypothesis, we have 
generated a transgenic silkworm, Fil > Rasl^"^, for im- 
proving silk yield. Importantly, overexpression of Rasl^^ 
increases gland size and protein synthesis in the PSG, 
improving fibroin production and silk yield by 60%. At 
the molecular level, we have determined that Ras activa- 
tion enhances phosphorylation levels of Ras downstream 
effector proteins, Raf and PI3K110, and thus activates its 
downstream Raf-MAPK and PI3K-TORC1 pathways [2]. 
To better understand the molecular mechanisms how 
Rasl^^ overexpression in the PSG improves fibroin pro- 
duction and silk yield, we performed both proteomics 
and transcriptomics. Unfortunately, in spite of the dis- 
covery that Rasl^^ upregulates bcpi to inhibit cathepsin 
activity and thus to prevent PSG destruction, we were 
not able to better understand how Rasl^^ improves fi- 
broin production and silk yield using proteomics [10]. 

The transcriptomic results of the Rasl ^^-overexpressed 
PSG presented here underlie a wide array of DEGs in 



many KEGG pathways. Importantly, we have discovered 
that a large number of DEGs in "pathways in cancer", "in- 
sulin signaling", and "MAPK signaling pathway" are 
upregulated by Rasl^^ overexpression in the PSG. Com- 
bined with our previous findings [2] and the present studies 
(Figure 9), we conclude that Rasl activates its downstream 
Raf-MAPK and PI3K-TORC1 pathways at both phosphory- 
lational and transcriptional levels. Moreover, we find that 
Rasl upregulates genes in "nucleotide metabolism" and 
"cell cycle" for increasing DNA content and inducing 
endoreplication (Figure 9). This study has advanced our 
knowledge on how Rasl^^ overexpression in the PSG im- 
proves fibroin production and silk production. 

Methods 

Animals 

The Bombyx strain, Dazao, was reared on fresh mulberry 
leaves in the laboratory at 25°C under 14 h light/ 10 h 
dark cycles. The transgenic silkworm Fil > Rasl^"*" was 
obtained as described previously by crossing Fil-GAL4 
with UAS-Rasl^^ [2]. The trans genie silkworms were 
reared under the same condition as the WT silkworms. 

Dissection of the PSGs 

The silkworm PSGs were dissected from the Rasl^^- 
overexpressed and WT silkworms at the early wandering 
stage, when silkworms just begin to spin [2,10]. The 
PSGs were used for Illumina-Solexa sequencing and 
qPCR. Throughout the paper, all qPCR experiments 
were performed in 3 biological duplicates. 

RNA extraction 

For Illumina sequencing, total RNA from the Rasl^^- 
overexpressed PSG or the WT PSG was isolated with 
TRIzol (Invitrogen, Carlsbad, CA, USA). To remove any 
residual DNA, samples were pretreated with RNase-free 



Transcriptional / Raf-MAPK 
regulation I PI3K-T0RC1 

Transcriptional 
regulation 

▼ 

Nucleotide metabolism 

Figure 9 A proposed model shows that Rasl*"'^ activates its 
downstream Raf-MAPK and PI3K-T0RC1 pathways at both 
phosphorylational and transcriptional levels and upregulates 
genes in "nucleotide metabolism" for inducing endoreplication. 
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DNase I (New England BioLabs, Ipswich, MA, USA) for 
30 minutes at 37°C. RNA quality was first verified using 
a 2100 Bioanalyzer RNA Nanochip (Agilent, Santa Clara, 
CA) with RNA Integrity Number (RIN) value over 8.5. 
RNA was then quantified using NanoDrop ND-1000 
Spectrophotometer (NanoDrop, Wilmington, DE). 

Library preparation and lllumina sequencing 

The RNA-seq libraries were prepared using Illuminaskit 
following the manufacturers protocol (lllumina, San 
Diego, CA). Approximately 20 [ig of total RNA from the 
Rasl^^ -over expressed PSG or the WT PSG was used to 
isolate mRNA using Sera-mag Magnetic Oligo (dT) 
Beads (lllumina). To avoid cDNA synthesizing bias by 
priming, the purified mRNA was fragmented into small 
pieces (100-400 bp) using divalent cations at 94°C for 
5 minutes. The double-stranded cDNA was first synthe- 
sized using the Superscript Double-Stranded cDNA Syn- 
thesis kit (Invitrogen, Camarillo, CA) with random 
hexamer (N6) primers (lllumina). Then the synthesized 
cDNA was subjected to end-repair and phosphorylation 
using T4 DNA polymerase, Klenow DNA polymerase, 
and T4 PNK. These repaired cDNA fragments were 3' 
adenylated using Klenow Exo- (3' to 5' exo minus, lllu- 
mina). lllumina paired-end adapters were ligated to the 
ends of these 3'-adenylated cDNA fragments.The ligated 
cDNA was then enriched with 15 rounds of PCR ampli- 
fication using PCR Primer PE 1.0 and PCR Primer PE 
2.0 (lllumina) with Phusion DNA Polymerase.The librar- 
ies were sequenced using lllumina Highseq 2000 plat- 
form according to the manufacturers instructions, 
lllumina sequencing was performed at Suzhou Encode 
Genomics Biotechnology Co-Ltd (Encode Genomics; 
Suzhou, China). 

Raw data preprocess 

Preprocessing was carried out with a stringent filtering 
process. First, we removed reads that do not pass the 
built-in Illumina's software Failed-Chastity filter accord- 
ing to the relation "failed-chastity < = 1", using a chastity 
threshold of 0.6, on the first 25 cycles. Second, we dis- 
carded all reads with adaptor contamination. Third, 
we ruled out low-quality reads containing ambiguous 
sequences "N". Finally, the reads with more than 10% 
Q < 20 bases were also removed. 

Genome mapping and abundance analysis 

Quality-filtered reads were then aligned to the Bombyx 
genome (release_2.0, ftp://silkdb.org/pub/current/Gen- 
ome/ by TopHat (version 2.0.4) [20] with the parameters 
"-bowtiel -r 0 -mate-std-dev 50 -N 3-solexal.3-quals" 
(insert size is set as 0). The resulting alignment data 
from Tophat were then fed to an assembler Cufflinks 
(version 2.0.1) to assemble aligned RNA-Seq reads into 



silkworm genome database (silkworm_glean.gff) and 
silkworm gene database (silkgenome.fa). Unigene abun- 
dances were measured by Fragments per kb of exon per 
million fragments mapped (FPKM) using the formula 
FPKM = (1,000,000^^0/ (N^^Ln,000)[21]. 

Functional annotations 

The DEGs in Rasl^^-over expressed and WT PSGs were 
functional annotated by GO annotation and KEGG an- 
notation. For GO annotation, the DEGs were first 
blasted against uniprot knowledgebase (including Swiss- 
Prot and TrEMBL) (UniProtKB; http://www.uniprot.org) 
to get uniprot IDs. Then the uniprot IDs were assigned 
to GO terms at three basic catergories including mo- 
lecular function, biological process, and cellular compo- 
nent. For KEGG annotation, DEGs were functionally 
annotated with KAAS (KEGG Automatic Annotation 
Server) by BLAST comparisons against the manually cu- 
rated KEGG GENES database. The result contains KO 
(KEGG Orthology) assignments and automatically gen- 
erated KEGG pathways. 

qPCR 

Total RNA of the i?(3^5i^^-overexpressed PSG or the WT 
PSG was extracted using TRIzol (Invitrogen). qPCR was 
performed as previously described [2,33]. The primers 
used in this paper are listed in Additional file 6: Table SI 
(Supporting Information). 

Chemical inhibitor treatment 

Small-molecule chemical inhibitors of the Ras downstream 
effectors (Rafi, Raf inhibitor, Santa Cruz; LY294002, PI3K 
inhibitor, Sigma; rapamycin, TORCl inhibitor, Sigma; 
15 (ig/larva) were injected into the Fil > Rasl^"^ larvae 
at the EW stage. At 24 h after the first injection, the PSGs 
were dissected for qPCR as previously described [10]. 

Statistics 

Experimental data were analyzed with the Students ^-test 
and ANOVA. ^-test: \ p < 0.05; *^ p < 0.01. ANOVA: the 
bars labeled with different lowercase letters are signifi- 
cantly different (p < 0.05). Throughout the paper, values 
are represented as the mean ± standard deviation of at 
least 3 independent experiments. 



Availability of supporting data 

The RNA-seq raw data were deposited to NCBI SRA 
with the accession number SRP026709 (http://www.ncbi. 
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Additional file 1: Figure SI. DEGs distributed in "pathways in cancer" 
(KEGG) signaling network. DEGs are marked with red square. 

Additional file 2: Figure S2. DEGs distributed in "insulin signaling 
pathway" (KEGG) signaling network. DEGs are marked with red square. 

Additional file 3: Figure S3. DEGs distributed in "MARK signaling 
pathway" (KEGG) signaling network. DEGs are marked with red square. 

Additional file 4: Figure S4. DEGs distributed in "purine metabolism" 
(KEGG) signaling network. DEGs are marked with red square. 

Additional file 5: Figure S5. ADEGs distributed in "pyrimidine 
metabolism" (KEGG) signaling network. DEGs are marked with red square. 

Additional file 6: Table SI. List of primers used for qPCR verification. 



